Peer-to-Peer content search supported by a distributed index in a publication/search model

نویسندگان

  • Gabriel Tolosa
  • Jorge A. Peri
  • Fernando Bordignon
چکیده

1 Research fellow at Secretaría de Investigación y Postgrado. Universidad Nacional de Luján. Argentina. ABSTRACT: Peer-to-peer networks (P2P) are considered a valid approach for the construction of distributed systems. Further research projects in the last few years have focused on using this kind of networks as an alternative for solving different situations that have traditionally required centralized servers, such as search engines. This paper deals with the problem of content search in highly distributed and dynamic environments. We propose and evaluate a distributed index model built upon a peer-to-peer network which supports complete indexing of text documents and allows searching by content. A distinctive feature of this proposal is that it requires no specific network topology or hierarchy. Evaluations with different settings were performed by simulating a 10,000-node network, where each node had the capability to share documents.With regard to the traffic generated, experiments show an improvement in efficiency of between 84% and 93% over similar systems like Gnutella. The evaluation of retrieval performance using a test collection showed that the P2P system was able to achieve the same level of performance as the centralized system. It was also found that the amount of traffic generated by this model varies between 80 and 225 Kb per set of query and answers.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Review on the Editorial Peer Review

Background and Objectives: The editorial peer review has an important role in the publication of scientific articles. Peers or reviewers are those scholars who have the expertise regarding the topic of a given article. They critically appraise the articles without having any monetary incentives or conflicts of interest. The aim of this study was to determine the most important aspects of the ed...

متن کامل

Query-Driven Indexing in Large-Scale Distributed Systems

Efficient and effective search in large-scale data repositories requires complex indexing solutions deployed on a large number of servers. Web search engines such as Google and Yahoo! already rely upon complex systems to be able to return relevant query results and keep processing times within the comfortable sub-second limit. Nevertheless, the exponential growth of the amount of content on the...

متن کامل

A Scalable Distributed Data Structure for Multi-Feature Similarity Search

Similarity search for content-based retrieval (where content can be any combination of text, image, audio/video, etc.) has gained importance in recent years, also because of the advantage of ranking the retrieved results according to their proximity to a query. However, to use similarity search in real world applications, we need to tackle the problem of huge volumes of such mixed multimedia da...

متن کامل

Proof: A Novel DHT-Based Peer-to-Peer Search Engine

In this paper we focus on building a large scale keyword search service over structured Peer-to-Peer (P2P) networks. Current stateof-the-art keyword search approaches for structured P2P systems are based on inverted list intersection. However, the biggest challenge in those approaches is that when the indices are distributed over peers, a simple query may cause a large amount of data to be tran...

متن کامل

Efficient Peer-to-Peer Keyword Searching

Today, exponential growth in network content makes it difficult to build and maintain a complete document index to support efficient search. Centralized search services must actively and repeatedly probe the network for new or changed content. The scope and rapid evolution of the Internet means that even the best pull-based search services will always be incomplete and inaccurate. Recently, how...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • JDIM

دوره 4  شماره 

صفحات  -

تاریخ انتشار 2006